Picture for Christos Kozyrakis

Christos Kozyrakis

Accelerating Mixture-of-Experts Training with Adaptive Expert Replication

Add code
Apr 28, 2025
Viaarxiv icon

Efficient GNN Training Through Structure-Aware Randomized Mini-Batching

Add code
Apr 25, 2025
Viaarxiv icon

AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution

Add code
Nov 05, 2024
Figure 1 for AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution
Figure 2 for AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution
Figure 3 for AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution
Figure 4 for AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution
Viaarxiv icon

Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight

Add code
Jul 11, 2024
Viaarxiv icon

SlipStream: Adapting Pipelines for Distributed Training of Large DNNs Amid Failures

Add code
May 22, 2024
Figure 1 for SlipStream: Adapting Pipelines for Distributed Training of Large DNNs Amid Failures
Figure 2 for SlipStream: Adapting Pipelines for Distributed Training of Large DNNs Amid Failures
Figure 3 for SlipStream: Adapting Pipelines for Distributed Training of Large DNNs Amid Failures
Figure 4 for SlipStream: Adapting Pipelines for Distributed Training of Large DNNs Amid Failures
Viaarxiv icon

cedar: Composable and Optimized Machine Learning Input Data Pipelines

Add code
Jan 25, 2024
Figure 1 for cedar: Composable and Optimized Machine Learning Input Data Pipelines
Figure 2 for cedar: Composable and Optimized Machine Learning Input Data Pipelines
Figure 3 for cedar: Composable and Optimized Machine Learning Input Data Pipelines
Figure 4 for cedar: Composable and Optimized Machine Learning Input Data Pipelines
Viaarxiv icon

Efficiently Programming Large Language Models using SGLang

Add code
Dec 12, 2023
Viaarxiv icon

FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models

Add code
Jan 08, 2023
Viaarxiv icon

RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure

Add code
Nov 14, 2022
Figure 1 for RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Figure 2 for RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Figure 3 for RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Figure 4 for RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Viaarxiv icon

RecShard: Statistical Feature-Based Memory Optimization for Industry-Scale Neural Recommendation

Add code
Jan 25, 2022
Figure 1 for RecShard: Statistical Feature-Based Memory Optimization for Industry-Scale Neural Recommendation
Figure 2 for RecShard: Statistical Feature-Based Memory Optimization for Industry-Scale Neural Recommendation
Figure 3 for RecShard: Statistical Feature-Based Memory Optimization for Industry-Scale Neural Recommendation
Figure 4 for RecShard: Statistical Feature-Based Memory Optimization for Industry-Scale Neural Recommendation
Viaarxiv icon